Korpus: nno_wikipedia_2016_100K

Weitere Korpora

3.6.2 Zipf's law for words of fixed lengths

Zipf distribution of words of fixed length 4, 6, 8, ..., 14.


Zipf's diagram for words of fixed length


Gnuplot diagram

Top Words of length 4
word rank frequency word
1 18228 vart
2 4401 over
3 3874 ligg
4 3640 vert
5 2917 fekk
Top Words of length 6
word rank frequency word
1 3732 mellom
2 2748 fleire
3 2436 første
4 1537 namnet
5 1397 songen
Top Words of length 8
word rank frequency word
1 674 framleis
2 578 Historie
3 497 menneske
4 491 perioden
5 484 Russland
Top Words of length 10
word rank frequency word
1 387 viktigaste
2 372 namngjeven
3 340 1800-talet
4 311 Stortinget
5 226 samstundes
Top Words of length 12
word rank frequency word
1 1136 innbyggjarar
2 322 forskjellige
3 139 landmålingar
4 126 temperaturen
5 119 sjølvstendig
Top Words of length 14
word rank frequency word
1 198 internasjonale
2 106 administrative
3 99 internasjonalt
4 69 United Kingdom
5 57 organisasjonen
Slope for length 4
Slope
-0.9835028775072241
Slope for length 6
Slope
-0.8540215031414027
Slope for length 8
Slope
-0.7424235772281392
Slope for length 10
Slope
-0.7122074521018298
Slope for length 12
Slope
-0.6712113404111031
Slope for length 14
Slope
-0.6712113404111031
829 msec needed at 2018-01-09 08:43